NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Wang, Yaxuan; Liu, Quan; Liu, Chris Yuhao; Pang, Jinlong; Wei, Wei; Bao, Yujia; Liu, Yang (July 2025, ICML 2025 Workshop on Machine Unlearning for Generative AI)

Free, publicly-accessible full text available July 13, 2026
DRAGON: Guard LLM Unlearning in Context via Negative Detection and Reasoning

Wang, Yaxuan; Liu, Quan; Liu, Chris Yuhao; Pang, Jinlong; Wei, Wei; Bao, Yujia; Liu, Yang (July 2025, ICML 2025 Workshop on Machine Unlearning for Generative AI)

Free, publicly-accessible full text available July 13, 2026
Improving Data Efficiency via Curating LLM-Driven Rating Systems

Pang, Jinlong; Wei, Jiaheng; Shah, Ankit; Zhu, Zhaowei; Wang, Yaxuan; Qian, Chen; Liu, Yang; Bao, Yujia; Wei, Wei (April 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Improving Data Efficiency via Curating LLM-Driven Rating Systems

Pang, Jinlong; Wei, Jiaheng; Shah, Ankit; Zhu, Zhaowei; Wang, Yaxuan; Qian, Chen; Liu, Yang; Bao, Yujia; Wei, Wei (April 2025, The Thirteenth International Conference on Learning Representations)

Instruction tuning is critical for adapting large language models (LLMs) to downstream tasks, and recent studies have demonstrated that small amounts of human-curated data can outperform larger datasets, challenging traditional data scaling laws. While LLM-based data quality rating systems offer a cost-effective alternative to human annotation, they often suffer from inaccuracies and biases, even in powerful models like GPT-4. In this work, we introduce DS2, a Diversity-aware Score curation method for Data Selection. By systematically modeling error patterns through a score transition matrix, DS2 corrects LLM-based scores and promotes diversity in the selected data samples. Our approach shows that a curated subset (just 3.3% of the original dataset) outperforms full-scale datasets (300k samples) across various machine-alignment benchmarks, and matches or surpasses human-aligned datasets such as LIMA with the same sample size (1k samples). These findings challenge conventional data scaling assumptions, highlighting that redundant, low-quality samples can degrade performance and reaffirming that "more can be less."
more » « less
Free, publicly-accessible full text available April 24, 2026
LLM Unlearning via Loss Adjustment with Only Forget Data

Wang, Yaxuan; Wei, Jiaheng; Liu, Chris Yuhao; Pang, Jinlong; Liu, Quan; Shah, Ankit; Bao, Yujia; Liu, Yang; Wei, Wei (April 2025, Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
LLM Unlearning via Loss Adjustment with Only Forget Data

Wang, Yaxuan; Wei, Jiaheng; Liu, Chris Yuhao; Pang, Jinlong; Liu, Quan; Shah, Ankit; Bao, Yujia; Liu, Yang; Wei, Wei (April 2025, Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available April 24, 2026
Large language model unlearning via embedding-corrupted prompts

Liu, Chris; Wang, Yaxuan; Flanigan, Jeffrey; Liu, Yang (December 2024, Advances in Neural Information Processing Systems 37 (2024): 118198-118266)

Full Text Available
Large Language Model Unlearning via Embedding-Corrupted Prompts

Liu, Chris Yuhao; Wang, Yaxuan; Flanigan, Jeffrey; Liu, Yang (December 2024, 2024 Conference on Neural Information Processing Systems)

Large language models (LLMs) have advanced to encompass extensive knowledge across diverse domains. Yet controlling what a large language model should not know is important for ensuring alignment and thus safe use. However, accurately and efficiently unlearning knowledge from an LLM remains challenging due to the potential collateral damage caused by the fuzzy boundary between retention and forgetting, and the large computational requirements for optimization across state-of-the-art models with hundreds of billions of parameters. In this work, we present \textbf{Embedding-COrrupted (ECO) Prompts}, a lightweight unlearning framework for large language models to address both the challenges of knowledge entanglement and unlearning efficiency. Instead of relying on the LLM itself to unlearn, we enforce an unlearned state during inference by employing a prompt classifier to identify and safeguard prompts to forget. We learn corruptions added to prompt embeddings via zeroth order optimization toward the unlearning objective offline and corrupt prompts flagged by the classifier during inference. We find that these embedding-corrupted prompts not only lead to desirable outputs that satisfy the unlearning objective but also closely approximate the output from a model that has never been trained on the data intended for forgetting. Through extensive experiments on unlearning, we demonstrate the superiority of our method in achieving promising unlearning at \textit{nearly zero side effects} in general domains and domains closely related to the unlearned ones. Additionally, we highlight the scalability of our method to 100 LLMs, ranging from 0.5B to 236B parameters, incurring no additional cost as the number of parameters increases. We have made our code publicly available at \url{this https URL}.
more » « less
Full Text Available
Clustering of Diverse Multiplex Networks

https://doi.org/10.1109/TNSE.2024.3374102

Pensky, Marianna; Wang, Yaxuan (July 2024, IEEE Transactions on Network Science and Engineering)

Full Text Available
Practical Speedup of Bayesian Inference of Species Phylogenies by Restricting the Space of Gene Trees

https://doi.org/10.1093/molbev/msaa045

Wang, Yaxuan; Ogilvie, Huw A; Nakhleh, Luay; Harris, Kelley (February 2020, Molecular Biology and Evolution)

Abstract Species tree inference from multilocus data has emerged as a powerful paradigm in the postgenomic era, both in terms of the accuracy of the species tree it produces as well as in terms of elucidating the processes that shaped the evolutionary history. Bayesian methods for species tree inference are desirable in this area as they have been shown not only to yield accurate estimates, but also to naturally provide measures of confidence in those estimates. However, the heavy computational requirements of Bayesian inference have limited the applicability of such methods to very small data sets. In this article, we show that the computational efficiency of Bayesian inference under the multispecies coalescent can be improved in practice by restricting the space of the gene trees explored during the random walk, without sacrificing accuracy as measured by various metrics. The idea is to first infer constraints on the trees of the individual loci in the form of unresolved gene trees, and then to restrict the sampler to consider only resolutions of the constrained trees. We demonstrate the improvements gained by such an approach on both simulated and biological data.
more » « less
Full Text Available

Search for: All records